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1. DESCRIPTION OF THE GSPRT 


Let p(@,x) be a parametrized distribution and let L(@,x) = Sy log p(@, zi) be 
the corresponding log-likelihood function for N independent trials. 

Let ¢(@) be a function of the parameters and assume given ¢o9, ¢; € R. The 
Generalized Sequential Probability Likelihood Ratio Test [1] for ¢ = do versus 
@ = ¢1 is based on monitoring the difference 


(1.1) LLR := L(6,,x) — L(6p,2) 


where 9, is the maximum likelihood estimator for 9 subject to the condition ¢(6,;) = 
dj. Le. 


6, = arg max L(6, 2) 
$(@)=¢: 


Like for the SPRT the test stops when LLR leaves the interval [log(6/(1—a)), log((1— 
8)/a)| where as usual a, 8 stand for the Type I,II error probabilities. 


2. AN APPROXIMATION 


Let 6 be the unconstrained maximum likelihood estimator for @, i.e. 


d= arg max L(6, x) 
8 


and put ¢ = 4(@). Let V(é) be an estimator for the variance Var(#) of ¢ with 
relative error O(1/VN) (thus any standard estimator will do). We claim that under 
suitable regularity conditions we have the following very convenient approximation 
for (1.1) 


(2.1) L(6,,2) _ L (8,2) ~ ; (1 — ze bo — ¢1) 


This means that the, sometimes quite cumbersome, calculation of the conditional 
estimates 6), 8, is not actually required. 


Example. Assume we take independent trials from a multinomial distribution with 


after N trials the outcome frequencies are (N;)i=1,....n (with N =)>_, Ni). Then 
to calculate (2.1) we calculate first the empiric probabilities p; := N;/N and then 
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we put 


This example is relevant for chess engine testing [2] in which case ¢ would stand for 
the expected score of a match. For the naive trinomial model one takes (p1, po, p3) = 
(w,d,l) and a, = 1, ag = 1/2, ag = 0 whereas in the 5-nomial model (for paired 
games with reversed colors) one takes (p1, p2, P3, Pa, Ps) = (P2,P3/2; P15 P1/2, Po) with 
a, = 1, ag = 3/4, ag = 1/2, ag = 1/4, a5 = 0. Note that in the 5-nomial model N 
is the number of games divided by two (one trial consists of two games). 


3. DERIVATION OF THE APPROXIMATION 


For simplicity we will give the derivation for one particular choice of V(¢). One 
may check that the relative change in the right hand side of (2.1), when replacing 
one V(@) by another, goes to zero when N goes to infinity. 

In order to verify (2.1) the first mission is to calculate 6;. Using Lagrange 


multipliers we see that we have to solve (i € {0,1}) 
VoL(8;,2) = AVoo(G;) 
(8;) = 9 
If 6 = d; then \ = 0, 6, = 6. We will assume that db is close to ¢; so that A is small. 
We get in first order (H = Hessian) 
(HoL(8, x) — \Heo(8)) - (8; — 8) = AVod(@) 
6 


Voo(8)'- (6, -0) = 4-6 


and hence 


(3.1) re ot ae ever 
AV oG(8)' » HoL(@,x)~* - Vog(8) = $i — ¢ 
Write V(¢) = —Vo(6)' - Ho L(G, 2)! - Voo(6). It is well-known that V(¢) is an 


approximation for the variance Var(d) of ¢. Then we get by eliminating \ from 
(3.1) 


6,-4=-* 9 HpL(0,2)-* - Vo9(d) 
V(9) 
Now we have (using the fact that @ is extremal for L(@,x)) 
A “ 1 


L(6,;,2) & L(6, x) + =(6, — 6)* - HoL(6, x) - (6, — 9) 


3 
A 1 (¢; — $)? 
6,2) — 5 


V(9) 
Substituting this in (1.1) yields what we want. 
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